A Scalable Cross-Language Metasearch Architecture* for Multilingual Information Access on the Web

نویسندگان

  • Yoshihiko Hayashi
  • Genichiro Kikui
  • Toshiaki Iwadera
چکیده

This position paper for the special session on "Multilingual Information Access" comprises of three parts. The first part reviews possible demands for Multilingual Information Access (hereafter, MLIA) on the Web, and examines required technical elements. Among those, we, in the second part, focus on Cross-Language Information Retrieval (hereafter, CLIR), particularly a scalable architecture which enables CLIR in a number of language combinations. Such a distributed architecture developed around XIRCH project (an international joint experimental project currently involves NTT, KRDL, and KAIST) is then described in a certain detail. The final part discusses some NLP/MT related issues associated with such a CLIR architecture.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Web-Based Information Access: Multilingual Automatic Authoring

The needs for managing similar documents in different languages increases with the growing amounts of electronic information available in documents of the same type (e.g. news streams). This paper proposes a viable approach to information access emphasizing the hypertextual paradigm in a multilingual framework. This task of processing/structuring text so that cross-lingual hypertext links are g...

متن کامل

Modern Multilingual and Cross-lingual Information Access Technologies

In this chapter, we describe the state of the art cross-lingual and multilingual strategies and their related areas. In particular, we show a WWW-based information system called MIETTA, which allows uniform and multilingual access to heterogeneous data sources in the tourism domain. The design of the search engine is based on a new cross-lingual framework. The framework integrates a cross-lingu...

متن کامل

Architectural Design of WebScales - A Large-Scale Metasearch Engine

It is estimated that there are hundreds of thousands of information sources on the Web, including both the Surface Web and the Deep Web. Most of these sources have their own search capabilities. In order to alleviate ordinary users from the formidable task of identifying useful sources and search them individually, it is important to provide a unified access to these sources. Metasearch engine ...

متن کامل

Challenges for the multilingual Web of Data

The Web has witnessed an enormous growth in the amount of semantic information published in recent years. This growth has been stimulated to a large extent by the emergence of Linked Data. Although this brings us a big step closer to the vision of a Semantic Web, it also raises new issues such as the need for dealing with information expressed in different natural languages. Indeed, although th...

متن کامل

Exploring the Effects of Language Skills on Multilingual Web Search

Multilingual access is an important area of research, especially given the growth in multilingual users of online resources. A large body of research exists for Cross-Language Information Retrieval (CLIR); however, little of this work has considered the language skills of the end user, a critical factor in providing effective multilingual search functionality. In this paper we describe an exper...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007